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Abstract — We describe two general mechanisms for producing pair- 
ing bijections (bijective functions defined from — !> N). The first 
mechanism, using ?^-adic valuations results in parameterized algorithms 
generating a countable family of distinct pairing bijections. The second 
mechanism, using characteristic functions of subsets of N provides 
2^* distinct pairing bijections. Mechanisms to combine such pairing 
functions and their application to generate families of permutations of 
M are also described. The paper uses a small subset of the functional 
language Haskell to provide type checked executable specifications of 
all the functions defined in a literate programming style. The self- 
contained Haskell code extracted from the paper is available at http: 
/Aogic.cse.unt.edu/tarau/research/2012/infpair.hs . 

Keywords: pairing / unpairing functions, data type isomorphisms, infinite 
data objects, lazy evaluation, functional programming. 



I. Introduction 

Definition 1: A pairing hijection is a bijection / : N x N — >■ N. 
Its inverse is called an unpairing bijection. 

We are emphasizing here the fact that these functions are bijections 
as the name pairing function is sometime used in the literature to 
indicate injective functions from N x N to N. 

Pairing bijections have been used in the first half of 19-th century 
by Cauchy as a mechanism to express duble summations as simple 
summations in series. They have been made famous by their uses in 
the second half of the 19-th century by Cantor's work on foundations 
of set theory. Their most well known application is to show that 
infinite sets like N and N x N have the same cardinality. A classic use 
in the theory of recursive functions is to reduce functions on multiple 
arguments to single argument functions. Reasons on why they are an 
interesting object of study in terms of practical applications ranging 
from multi-dimensional dynamic arrays to proximity search using 
space filling curves are described in ||6J, I^TJ, | ,14J . 

Like in the case of Cantor's original function f{x,y) — |(a; + 
y){x + y + 1) + y, pairing bijections have been usually hand-crafted 
by putting to work geometric or arithmetic intuitions. 

While it is easy to prove (non-constructively) that there is an 
uncountable family of distinct pairing bijections, we have not seen 
in the literature general mechanisms for building families of pairing 
bijections indexed by N or 2^*. It is even easier to generate (construc- 
tively) a countable family of pairing functions simply by modifying 
its result of a fixed pairing function with a reversible operation (e.g 
XOR with a natural number, seen as the index of the family). 

This paper introduces two general mechanisms for generating 
such families, using n-adic valuations (section |ll| and characteristic 
functions of subsets of N (section [lll|, followed by a discussion of 
related work (section [IV| and our conclusions (section [Vf. 

We will give here a glimpse of why our arguably more complex 
pairing bijections are interesting. 

The n-adic valuation based pairing functions will provide a general 
mechanism for designing strongly asymmetric pairing functions, 
where changes in one of the arguments have an exponential impact 
on the result. 



The characteristic-function mechanism, while intuitively obvious, 
opens the doors, in combination with a framework providing bijec- 
tions between them and arbitrary data- types [18], to custom-build 
arbitrarily intricate pairing functions associated to for instance to 
"interesting" sequences of natural numbers or binary expansions of 
1 15 1 real numbers. 

We will use a subset of the non-strict functional language Haskell 
(seen as an equational notation for typed A-calculus) to provide 
executable definitions of mathematical functions on N, pairs in N x N, 
subsets of N, and sequences of natural numbers. We mention, for the 
benefit of the reader unfamiliar with the language, that a notation 
like f X Y stands for f{x,y), [t] represents sequences of type 
t and a type declaration like f :: s->t->u stands for a 
function / : s x t — >■ u (modulo Haskell's "currying" operation, 
given the isomorphism between the function spaces s x t — >■ it and 
s — )■ f — )■ It). Our Haskell functions are always represented as sets 
of recursive equations guided by pattern matching, conditional to 
constraints (simple arithmetic relations following | and before the 
= symbol). Locally scoped helper functions are defined in Haskell 
after the where keyword, using the same equational style. The 
composition of functions f and g is denoted f . g. It is also 
customary in Haskell, when defining functions in an equational style 
(using =) to write f = g instead of f x — g x ("point-free" notation). 
The use of Haskell's "call-by-need" evaluation allows us to work with 
infinite sequences, like the [ . . ] infinite list notation, corresponding 
to the set N itself 

II. Deriving Pairing Bijections from n-ADic valuations 

We first overview a mechanism for deriving pairing bijections from 
one-solution Diophantine equations. Let us observe that 

Proposition 1: Mz £ f^'^ = N — {0} the Diophantine equation 

2"(2j/ + l) = 2 (1) 

has exactly one solution x, y G N. 

This follows immediately from the unicity of the decomposition of 
a natural number as a multiset of prime factors. Note that a slight 
modification of equation [T] results in the pairing bijection originally 
introduced in js], jsj, seen as a mapping between the pair [x, y) and 
z. 

2^{2y + l)-l = z (2) 

We will generalize this mechanism to obtain a family of bijections 
between N x N and (and the corresponding pairing bijections 
between N x N and N) by choosing an arbitrary base b instead of 2. 

Definition 2: Given a number n G N, n > 1, the n-adic valuation 
of a natural number m is the largest exponent k of n, such that n'' 
divides m. It is denoted Un{m). 

Note that the solution x of the equation ([TJ is actually V2{z). This 
suggest deriving similar Diophantine equations for an arbitrary n-adic 
valuation. We start by observing that the following holds: 

Proposition 2: V6 G N, 6 > 1, Vy G N if 3g, m such that h > 
m > 0, y = 6g + m, then there's exactly one pair (y', m'), fe — 1 > 
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m' > such that y' — (& — l)g + m' and the function associating 
{y',m') to {y,rn) is a bijection. 

Proof: y = bq + nijb > m > can be rewritten as ?/ — g — 1 = 
6g — Q + m — 1, 6 > m > 0, or equivalently y — g— 1 = (6— l)q + 
(m — l),b > m > from where it follows that setting y' = y — q—1 
and m' — m~l ensures the existence and unicity of y' and m' such 
that y' — {b— l)q + m' and 6— 1 > m' > 0. We can therefore define 
a function / that transforms a pair {y,m), such that y = bq + m 
with & > m > 0, into a pair {y' , m'), such that y' — q(b — 1) + m' 
with 6 — 1 > m' > 0. Note that the transformation works also in the 
opposite direction with y' — y — q — 1 giving y — y' + q + 1, and 
with m' — m — 1 giving m — m' + 1. Therefore / is a bijection. ■ 
Proposition 3: Vb G N, b > 1, € N,z > the system of 
Diophantine equations and inequations 

b" * (y' + g + 1) = z (3) 

y' = {b-l)q + m' (4) 

b - 1 > m' > (5) 

has exactly one solution x, y' £ N. 

Proof: Let be the inverse of the bijection / defined in 
Proposition [2] Then /^^ provides the desired unique mapping, that 
gives y = y' + g+l and m — m' — 1 such that 6 > m > 0. Therefore 
y = m {mod b) with m > 0. And as y is not divisible with b, we can 
determine uniquely x as the largest power of b dividing z, x = ui,{z). 

■ 

We implement, for and arbitrary b G N, the Haskell code cor- 
responding to these bijections as the functions nAdicCons b and 
nAdicDeCons b, defined between N x N and N'^ . 

nAdicCons :: N->(N,N)-5>N 

nAdicCons b (x, y' ) | b>l = (b~x) *y where 

q = y' Miv' (b-1) 

y = y'-Kjfl 

nAdicDeCons :: N-s>N-> (N,N) 

nAdicDeCons b z | b>l && z>0 = (x, y') where 

hd n = if n 'mod' b > then else 14hd (n 'div' b) 
X = hd z 

y = z 'div' (b"x) 
q = y 'div' b 

y' = y-q-i 

Using nAdicDeCons we define the head and tail projection func- 
tions nAdicHead and nAdicTail: 

nAdicHead, nAdicTail : : N— 5>N 
nAdicHead b = f st . nAdicDeCons b 
nAdicTail b = snd . nAdicDeCons b 

The following examples illustrate the operations for base 3: 

*InfPair> nAdicCons 3 (10,20) 
1830519 

*InfPair> nAdicHead 3 1830519 
10 

*InfPair> nAdicTail 3 1830519 
20 

Note that nAdicHead n x computes the n-adic valuation of x, 
v„{x) while the tail corresponds to the "information content" ex- 
tracted from the remainder, after division by Vnix). 

Definition 3: We call the natural number computed by 
nAdicHead n x the n-adic head of x G N"'", by nAdicTail n 
X the n-adic tail of a; G N""" and the natural number in computed 
by nAdicCons n (x,y) the n-adic cons of a;, y G N. 

By generalizing the mechanism shown for the equations [T| and |2] 
we derive from nAdicDeCons and nAdicCons the corresponding 
pairing and impairing bijections nAdicPair and nAdicUnPair: 



nAdicUnPair :: N-^N--s> (N, N) | 
nAdicUnPair b n = nAdicDeCons b (n-fl) 

nAdicPair :: N— 5>(N,N)->N 

nAdicPair b xy = (nAdicCons b xy) -1 I 

One can see that we obtain a countable family of bijections fb : 
N X N N indexed by b G N, b > 1. 

The following examples illustrate the work of these bijections for 
b — 3. Note the use of Haskell's higher-order function "map", that 
applies the function nAdicUnPair 3 to a list of elements and 
collects the results to a list, and the special value "it", standing 
for the previously computed result. 

*InfPair> map (nAdicUnPair 3) [0..7] 

[(0,0), (0,1), (1,0), (0,2), (0,3), (1,1), (0,4), (0,5)] 

*InfPair> map (nAdicPair 3) it 

[0,1,2,3,4,5,6,7] 

1) Deriving bijections between N and [N]: For each base b>l, 
we can also obtain a pair of bijections between natural numbers and 
lists of natural numbers in terms of nAdicHead, nAdicTail and 
nAdicCons: 

nat2nats : : N--s>N--s> [N] 
nat2nats _ = [] 
nat2nats b n | ri>0 = 

nAdicHead b n : nat2nats b (nAdicTail b n) 

nats2nat : : N--s> [N] --s>N 
nats2nat _ [ ] = 

nats2nat b (x:xs) = nAdicCons b (x,nats2nat b xs) 

The following example illustrate how they work: 

*InfPair> nat2nats 3 2012 
[0,2,2,0,0,0,0] 

*InfPair> nats2nat 3 it 
2012 

Using the framework introduced in f 16| , |18| and summarized in 
the Appendix, we can "reify" these bijections as Encoders between 
natural numbers and sequences of natural numbers (parameterized by 
the first argument of nAdicHead and nAdicTail). Such Encoders 
can now be "morphed", by using the bijections provided by the 
framework, into various data types sharing the same "information 
content" (e.g. lists, sets, multisets). 

nAdicNat : : N— ^Encoder N 1 
nAdicNat It = Iso (nat2nats k) (nats2nat k) \ 

In particular, for fc = 2, we obtain the Encoder corresponding 
to the Diophantine equation ^ 

nat : : Encoder N 1| 
nat = nAdicNat 2 |' 

The following examples illustrate these operations, lifted through the 
framework defining bijections between datatypes, given in Appendix. 

*InfPair> as (nAdicNat 3) list [2,0,1,2] 
873 

*InfPair> as (nAdicNat 7) list [2,0,1,2] 
27146 

*InfPair> as nat list [2,0,1,2] 
300 

*InfPair> as list nat it 
[2,0,1,2] 

2) Deriving new families of Encoders and Permutations ofN: For 
each l,k £ N one can generate a family of permutations (bijections 
f : N N), parameterized by the pair (l,k) , by composing 

nat2nats 1 and nats2nat k. 



3 



nAdicBij : : N — > N — > N — > N 

nAdicBij k 1 = (nats2nat 1) . (nat2nats k) ^^^^^^^^^^^^ 

The following example illustrates their work on the initial segment 
[0. .31] of N: 

*InfPair> map {nAdicBij 2 3) [0..31] 

[0,1, 3, 2, 9, 5, 6, 4, 27, 14, 15, 8, 18, 10, 12, 7, 81, 41, 42, 

22, 45, 23, 24, 13, 54, 28, 30, 16, 36, 19, 21, 11] 
*InfPair> map (nAdicBij 3 2) [0..31] 
[0,1, 3, 2, 7, 5, 6, 15, 11, 4, 13, 31, 14, 23, 9, 10, 27, 63, 

12, 29, 47, 30, 19, 21, 22, 55, 127, 8, 25, 59, 26, 95] 

It is easy to see that the following holds: 
Proposition 4: 

{nAdicBij k I) o {nAdicBij I k) = id (6) 

As a side note, such bijections might have applications to cryptogra- 
phy, provided that a method is devised to generate "interesting" pairs 
(k,l) defining the encoding. 

We can derive Encoders representing functions between N 
and sequences of natural numbers, parameterized by a (possibly 
infinite) list of nAdicHead / nAdicTail bases, by repeatedly 
applying the n-adic head, tail and cons operation parameterized by 
the (assumed infinite) sequence ks: 

nAdicNats : : [N] — !>Encoder N 

nAdicNats ks = Iso (nat2nAdicNats ks) (nAdicNats2nat ks) 

nat2nAdicNats : : [N] -5>N-5> [N] 
nat2nAdicNats _ = [] 
nat2nAdicNats (k:ks) n | rOO = 

nAdicHead k n : nat2nAdicNats ks (nAdicTail k n) 

nAdicNats2nat : : [N] ^ [N] -)-N 

nAdicNats2nat _ [] = 
nAdicNats2nat (k:ks) (x:xs) = 

\ll^m^mMmmiSa^eumi!m^^ ^f«''y'«^i'^'mMmafmm I ■mrwf 

For instance, the Encoder nat' corresponds to ks defined as the 
infinite sequence starting at 2. 

nat' : : Encoder N 

nat' = nAdicNats [2. .] ^ 

The following examples illustrate the mechanism: 

*InfPair> as nat' list [2,0,1,2] 
1644 

*InfPair> as list nat' it 
[2,0,1,2] 

*InfPair> map (as nat' nat) [0..15] 
[0,1, 2, 3, 4, 7, 6, 5, 8, 19, 14, 15, 12, 13, 10, 9] 
*InfPair> map (as nat' nat) [0..15] 
[0,1, 2, 3, 4, 7, 6, 5, 8, 19, 14, 15, 12, 13, 10, 9] 

Note that functions like as nat' nat illustrate another general 
mechanism for defining permutations of N. 

III. Pairing bijections derived from characteristic 

FUNCTIONS OF SUBSETS OF N 

We start by connecting the bitstring representation of characteristic 
functions to our bijective data transformation framework (overviewed 
in the Appendix). 

A. The bijection between lists and characteristic functions of sets 

The function list2bins converts a sequence of natural numbers 
into a characteristic function of a subset of represented as a string 
of binary digits. The algorithm interprets each element of the list 
as the number of digits before the next 1 digit. Note that infinite 
sequences are handled as well, resulting in infinite bitstrings. 



Iist2bins : : [N] [N] 

list2bins [] = [0] 
list2bins ns = f ns where 
f [] = [] 

f (x:xs) = (repl x 0) -H- (l:f xs) where 
repl n a I n < = [] 

The function bin21ist converts a characteristic function repre- 
sented as bitstrings back to a list of natural numbers. 

bins21ist : : [N] [N] 
bins21ist xs = f xs where 

f [] _= [] 

f (0:xs) k = f xs (k+1) 
f (l:xs) k = k : f xs 

Together they provide the Encoder bins, that we will use to connect 
characteristic functions to various data types. 

bins : : Encoder [N] 

bins = Iso bins21ist list2bins 

The following examples (where the Haskell library function take 
is used to restrict execution to an initial segment of an infinite Ust) 
illustrate their use: 

*InfPair> list2bins [2,0,1,2] 

[0,0,1,1,0,1,0,0,1] 
*InfPair> bins21ist it 
[2,0,1,2] 

*InfPair> take 20 (list2bins [0,2..]) 
[1,0,0,1,0,0,0,0,1,0,0,0,0,0,0,1,0,0,0,0] 
*InfPair> bins21ist it 
[0,2,4,6] 

The following holds: 

Pwpoji'ft'on 5.- If M is a subset of N, the bijection as bins set 
returns the bitstring associated to M and its inverse is the bijection 
as set bins. 

Proof: Observe that the transformations are the composition of 
bijections between bitstrings and lists and bijections between lists 
and sets. ■ 
The following example illustrates this correspondence: 

*InfPair> as bins set [0,2,4,5,7,8,9] 
[1,0,1,0,1,1,0,1,1,1] 

*InfPair> as set bins it 
[0,2,4,5,7,8,9] 

Note that, for convenient use on finite sets, the functions do not 
add the infinite stream of digits indicating its infinite stream of 
non-members, but we will add it as needed when the semantics of 
the code requires it for representing accurately operations on infinite 
sequences. We will use the same convention through the paper. 

B. Splitting and merging bitstrings with a characteristic function 

Guided by the characteristic function of a subset of I'sJ, represented 
as a bitstring, the function bsplit separates a (possibly infinite) 
sequence of numbers into two Usts: members and non-members. 

bsplit : : [N] ^ [N] ( [N] , [N] ) 
bsplit _ [] = ([], []) 
bsplit [] (n:ns) = 

error ("bspilt provides no guidance at: "-|-|-(show n) ) 
bsplit (0:bs) (n:ns) = (xs,n:ys) where 

(xs,ys) = bsplit bs ns 
bsplit (l:bs) (n:ns) = (n:xs,ys) where 
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Guided by the characteristic function of a subset of N, represented 
as a bitstring, the function bmerge merges two lists of natural 
numbers into one, by interpreting each 1 in the characteristic function 
as a request to extract an element of the first list and each as a 
request to extract an element of the second list. 

bmerge : : [N] -> ( [N] , [N] ) [N] 

bmerge _ ([],[]) = [] 

bmerge bs ( [ ] , [y] ) = [y] 

bmerge bs ( [x] , [ ] ) = [x] 

bmerge bs ([],ys) = bmerge bs ([0],ys) 

bmerge bs (xs, [] ) = bmerge bs (xs, [0] ) 

bmerge (0:bs) (xs,y:ys) =y : bmerge bs (xs,ys) 

bmerge (l:bs) (x:xs,ys) = x : bmerge bs (xs,ys) 

The following examples (trimmed to finite lists) illustrate their use: 

*InfPair> bsplit [0,1,0,1,0,1] [10,20,30,40,50,60] 

( [20, 40, 60] , [10, 30, 50] ) 

*InfPair> bmerge [0,1,0,1,0,1] it 

[10, 20, 30, 40, 50, 60] 

C. Defining pairing bijections, generically 

We design a generic mechanism to derive pairing functions by 
combining the data type transformation operation as with the 
bsplit and bmerge functions that apply a characteristic function 
encoded as a list of bits. 

genericUnpair : : Encoder t — > t — > N — > (N, N) 
genericUnpair xEncoder xs n = (l,r) wlriere 

bs = as bins xEncoder xs 

ns = as bins nat n 
(Is, rs) = bsplit bs ns 

1 = as nat bins Is 
. r = as nat bins rs 

genericPair : : Encoder t — > t — (N, N) — > N 
genericPair xEncoder xs (l,r) = n wliere 

bs = as bins xEncoder xs 

is = as bins nat 1 

rs = as bins nat r 

ns = bmerge bs (Is, rs) 

n = as nat bins ns 

Let us observe first that for termination of this functions depends 
on termination of the calls to bsplit and bmerge, as illustrated 
by the following examples: 

*InfPair> genericPair bins (cycle [0]) (10,20) 
'CInterrupted. 

*InfPair> genericUnpair bins (cycle [1]) 42 
( "CInterrupted. 

In this case, the characteristic functions given by cycle [0] or 
cycle [ 1 ] would trigger an infinite search for a non-existing first 
1 or in bsplit and bmerge. 

Clearly, this suggests restrictions on the acceptable characteristic 
functions. 

We will now give sufficient conditions ensuring that the functions 
genericUnpair and genericPair terminate for any values of 
their last arguments. Such restrictions, will enable them to define 
families of pairing functions parameterized by characteristic functions 
derived from various data types. 

Definition 4: We call bloc of digits occurring in a characteristic 
function any (finite or infinite) contiguous sequence of digits. 
Note that an infinite bloc made entirely of (or 1) digits can only 
occur at the end of the sequence defining the characteristic function, 
i.e. only if it exists a number n such that the index of each member 
of the bloc is larger than n. 



Proposition 6: If {a„}„gN is an infinite sequence of bits con- 
taining only finite blocks of and 1 digits, genericPair bins 
and genericUnPair bins define a family of pairing bijections 
parameterized by {a„}„gN. 

Proof: Having an alternation of finite blocks of Is 
and Os, ensures that, when called from genericPair and 
genericUnPair, the functions bmerge and bsplit terminate. 

■ 

For instance, Morton codes [4j are derived by using a stream of 
alternating 1 and digits (provided by the Haskell library function 
cycle) 

bunpair2 = genericUnpair bins (cycle [1,0]) 
bpair2 = genericPair bins (cycle [1,0]) 

and working as follows: 

*InfPair> map bunpair2 [0..10] 
[ (0, 0) , (1,0) , (0, 1) , (1, 1) , (2,0) , 

(3,0), (2,1), (3,1), (0,2), (1,2), (0,3)] 
*InfPair> map bpair2 it 
[0,1,2,3,4,5,6,7,8,9,10] 

Proposition 7: If {a„}„gN is an infinite sequence of non- 
decreasing natural numbers, the functions genericPair set and 
genericUnPair set define a family of pairing bijections param- 
eterized by {a„}„gN. 

Proof: Given that the sequence is non-decreasing, it represents 
canonically an infinite set such that its complement is also infinite, 
represented as a non-decreasing sequence. Therefore, the associated 
characteristic function will have an alternation of finite blocks of 1 
and digits, inducing a pairing/unpairing bijection. ■ 

The bijection bpair k and its inverse bunpair k are derived 
from a set representation (implicitly morphed into a characteristic 
function). 

bpair k = genericPair set [0,k. .] ^ 
bunpair k = genericUnpair set [0,1<. . ] I 

Note that for Ic = 2 we obtain exactly the bijections bpair2 
and bunpair 2 derived previously, as illustrated by the following 
example: 

*InfPair> map (bunpair 2) [0..10] 

[ (0,0) , (1,0) , (0, 1) , (1, 1) , (2,0) , (3,0) , 

(2, 1) , (3, 1) , (0,2) , (1,2) , (0,3) ] 
*InfPair> map (bpair 2) it 
[0,1,2,3,4,5,6,7,8,9,10] 

We conclude with a similar result for lists: 

Proposition 8: If {an} new is an infinite sequence of natural num- 
bers only containing finite blocks of Os, the functions genericPair 
list and genericUnPair list define a family of pairing 
bijections parameterized by {a„}„gN. 

Proof: It follows from Prop.|7]by observing that such sequences 
are transformed into infinite sets represented as non-decreasing se- 
quences. ■ 
The Appendix discusses a few more examples of such pairing 
functions and visualizes a few space-filling curves associated to them. 

Proposition 9: There are 2^ pairing functions defined using char- 
acteristic functions of sets of N. 

Proof: Observe that a characteristic function corresponding to a 
subset of N containing an infinite bloc of or 1 digits necessarily 
ends with the bloc. Therefore, by erasing the bloc we can put such 
functions in a bijection with a finite subset of I^. Given that there are 
only a countable number of finite subsets of I^, the cardinality of the 
set of the remaining subsets' characteristic functions is 2". ■ 
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IV. Related Work 

Pairing functions iiave been used in work on decision problems 
as early as |3|, |8|, ^-(13). There are about 19200 Google 
documents referring to the original "Cantor pairing function" among 
which we mention the surprising result that, together with the 
successor function it defines a decidable subset of arithmetic |2|. An 
extensive study of various pairing functions and their computational 
properties is presented in yj, y_4|. They are also related to 2D-space 
filling curves (Z-order, Gray-code and Hilbert curves) (4)-(7). Such 
curves are obtained by connecting pairs of coordinates correspond- 
ing to successive natural numbers (obtained by applying unpairing 
operations). They have applications to spatial and multi-dimensional 
database indexing ^4J-^7J and symbolic arbitrary length arithmetic 
computations (19| . Note also that bpair 2 and bunpair 2 are 
the same as the functions defined in f9| and also known as Morton- 
codes, with uses in indexing of spatial databases ^6J. 

V. Conclusion 

We have described mechanisms for generating countable and 
uncountable families of pairing / unpairing bijections. The mecha- 
nism involving n-adic valuations is definitely novel, and we have 
high confidence (despite of their obviousness) that the characteristic 
function-based mechanisms are novel as well, at least in terms of 
their connections to list, set or multiset representations provided by 
the implicit use of our bijective data transformation framework [16|. 

Given the space constraints, we have not explored the natural 
extensions to more general tupling / untupUng bijections (defined 
between N*" and N) as well as bijections between finite lists, 
sets and multisets that can be derived quite easily, using the data 
transformation framework given in the Appendix. For the same 
reasons we have not discussed specific applications of these families 
of pairing functions, but we foresee interesting connections with 
possible cryptographic uses (e.g "one time pads" generated through 
intricate combinations of members of these families). 

The ability to associate such pairing functions to arbitrary charac- 
teristic functions as well as to their equivalent set, multiset, list rep- 
resentations provides convenient tools for inventing and customizing 
pairing / unpairing bijections, as well as the related tupling / untupling 
bijections and those defined between natural numbers and sequences, 
sets and multisets of natural numbers. 

We hope that our adoption of the non-strict functional language 
Haskell (freely available from haskell.org I, as a complement to 
conventional mathematical notation, enables the empirically curious 
reader to instantly validate our claims and encourage her/him to 
independently explore their premises and their consequences. 
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Appendix 

An Embedded Data Transformation Language 

We will describe briefly the embedded data transformation lan- 
guage used in this paper as a set of operations on a groupoid of 
isomorphisms. We refer to ( |16| , |17| ) for details. 

The Groupoid of Isomorphisms: We implement an isomorphism 
between two objects X and Y as a Haskell data type encapsulating 
a bijection / and its inverse g. 



We will call the from function the first component (a section in 
category theory parlance) and the to function the second component (a 
retraction) defining the isomorphism. The isomorphisms are naturally 
organized as a groupoid. 

data ISO a b = Iso (a— >b) (b— 5>a) i 
from (Iso f _) = f 
to (Iso _ g) = g 

compose : : Iso a b — s> Iso b c — s> Iso a c 

compose (Iso f g) (Iso f g' ) = Iso (f . f) (g . g' ) 

itself = Iso id id 

invert (Iso f g) = Iso g f ^ 

Assuming that for any pair of type Iso a b, .fog = idi, and 
g o f — ida, we can now formulate laws about these isomorphisms. 
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The data type I s o has a groupoid structure, i. e. the compose 
operation, when defined, is associative, itself acts as an identity 
element and invert computes the inverse of an isomorphism. 

The Hub: Sequences of Natural Numbers: To avoid defining 
"'■"^""'"^ isomorphisms between n objects, we clioose a Hub object 
to/from which we will actually implement isomorphisms. 

Choosing a Hub object is somewhat arbitrary, but it makes sense 
to pick a representation that is relatively easy convertible to various 
others and scalable to accommodate large objects up to the runtime 
system's actual memory limits. 

We will choose as our Hub object sequences of natural numbers. 
We will represent them as lists i.e. their Haskell type is [N] . 

type N = Integer 
type Hub = [N] 

We can now define an Encoder as an isomorphism connecting an 
object to Hub 

type Encoder a = Iso a Hub 

together with the combinator "as", providing an embedded transfor- 
mation language for routing isomorphisms through two Encoders. 

as : : Encoder a — > Encoder b — > b — > a 
as that this x = g x where Iso _ g = 
compose that (invert this) 

The combinator "as" adds a convenient syntax such that converters 
between A and B can be designed as: 

a2b X = as B A X 
b2a X = as A B X 



a2h ^ as B A 




Hub 



Given that [N] has been chosen as the root, we will define our 
sequence data type list simply as the identity isomorphism on 
sequences in [ N ] . 

list : : Encoder [N] 
list = itself 

The Encoder mset for multisets of natural numbers is defined as: 

mset : : Encoder [N] 

mset = Iso mset21ist list2mset 



Examples of pairing functions derived from characteristic functions 

The function Syracuse is used in an equivalent formulation of 
the CoUatz conjecture. Interestingly, it can be computed using the 
nAdicTail which results after dividing a number n with fi2{n). 
Note that we derive our pairing function directly from the list 
representation of the range of this function as genericPair and 
genericUnpair implicitly construct the associated characteristic 
function. 

Syracuse : : N— >N 

Syracuse n = nAdicTail 2 (6*n+4) 

nsyr = [0] 

nsyr n = n : nsyr (Syracuse n) 
syrnats = map Syracuse [0. . ] 

syrpair = genericPair list syrnats j 
syrunpair = genericUnpair list syrnats 

Figures[T]and[2]show the "Z-order" (Morton code) path connecting 
successive values in the range of the function bunpair 2 and 
bunpair 3. Figures |3] and |4] show the path connecting the values 
in the range of unpairing functions associated, respectively to the 
Syracuse function and the binary digits of n. Interestingly, at a 
first glance, some regular patterns emerge even in the case of such 
notoriously irregular characteristic functions. 




^ 10 15 20 25 30 35 



Fig. 1. Path connecting values of bunpair 2 



mset21ist, list2mset : : [N] -> [N] 
mset21ist xs = zipWith (-) (xs) (0:xs) 
list2mset ns = tail (scanl (+) ns) 

The Encoder set for sets of natural numbers is defined as: 



set : : Encoder [N] 

set = Iso set21ist list2set 



set21ist, list2set : : [NH> [N] 

list2set = (map pred) . Iist2mset . (map succ) 

set21ist = (map pred) . mset21ist . (map succ) 

Note that these converters between lists, multisets and sets make no 
assumption about finiteness of their arguments and therefore they can 
used in a non-strict language like Haskell on infinite objects as well. 





Fig. 4. Path connecting values of an unpairing bijection based on binary 
digits of TT 



